Learning What Is Relevant to the E ects of Actions for a Mobile Robot

نویسندگان

  • Matthew D. Schmill
  • Michael T. Rosenstein
  • Paul R. Cohen
چکیده

We have developed a learning mechanism that allows robots to discover the conditional eeects of their actions. Based on sensorimotor experience, this mechanism permits a robot to explore its environment and observe eeects of its actions. These observations are used to learn a context operator diierence table, a structure that relates circumstances (context) and actions (operators) to eeects on the environment. From the context operator diierence table, one can extract a relatively small set of state variables, which simpli-es the problem of learning policies for complex activities. We demonstrate results with the Pioneer 1 mobile robot. 1 Introduction This paper describes a method by which a robot learns context-dependent eeects of its actions. The robot is a Pioneer 1, manufactured by Real World Interfaces, Inc., equipped with sonars and a gripper. By interacting with its environment, the robot learns a context operator diier-ence table (CODT) that relates primitive actions to changes in sensor readings. In general, the eeects of actions will depend on context. For instance, if the robot has run into an obstacle, then attempted forward movement will have little or no eeect on the values returned by forward-pointing sonars, whereas moving backward, away from the obstacle , will aaect these sonar readings. Each cell in a CODT speciies the eeects of a single action on readings from a single sensor in context. For example, when the robot is jammed against an obstacle, the left-wheel-stalled and right-wheel-stalled sensors return \true", and in this context , attempted forward movement will have no eeect on the robot's frontmost sonar, but attempted backward movement will be accompanied by an increase in the readings from that same sonar. Conditional knowledge of this sort is stored in decision trees, and the CODT contains one such tree learned for every action-sensor pair. Although CODTs represent the eeects of actions, they are not used to control the robot. Control policies are learned by another algorithm, described below. In fact, the

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual Tracking using Learning Histogram of Oriented Gradients by SVM on Mobile Robot

The intelligence of a mobile robot is highly dependent on its vision. The main objective of an intelligent mobile robot is in its ability to the online image processing, object detection, and especially visual tracking which is a complex task in stochastic environments. Tracking algorithms suffer from sequence challenges such as illumination variation, occlusion, and background clutter, so an a...

متن کامل

Navigation of a Mobile Robot Using Virtual Potential Field and Artificial Neural Network

Mobile robot navigation is one of the basic problems in robotics. In this paper, a new approach is proposed for autonomous mobile robot navigation in an unknown environment. The proposed approach is based on learning virtual parallel paths that propel the mobile robot toward the track using a multi-layer, feed-forward neural network. For training, a human operator navigates the mobile robot in ...

متن کامل

Dynamic Obstacle Avoidance by Distributed Algorithm based on Reinforcement Learning (RESEARCH NOTE)

In this paper we focus on the application of reinforcement learning to obstacle avoidance in dynamic Environments in wireless sensor networks. A distributed algorithm based on reinforcement learning is developed for sensor networks to guide mobile robot through the dynamic obstacles. The sensor network models the danger of the area under coverage as obstacles, and has the property of adoption o...

متن کامل

A Q-learning Based Continuous Tuning of Fuzzy Wall Tracking

A simple easy to implement algorithm is proposed to address wall tracking task of an autonomous robot. The robot should navigate in unknown environments, find the nearest wall, and track it solely based on locally sensed data. The proposed method benefits from coupling fuzzy logic and Q-learning to meet requirements of autonomous navigations. Fuzzy if-then rules provide a reliable decision maki...

متن کامل

Effect of levels of realism of mobile-based pedagogical agents on health e-learning

Background: One of the ways for effective communication between learners and instructional multimedia content in mobile learning systems is taking advantage of characters or pedagogical agents. The present study aimed to investigate the effect of the levels of realism in mobile-based pedagogical agents on health e-learning. Methods: The s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998